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Description 

Prnw-Bef erpnnB to R elat e d A pplications 

This application is a continuation-in-part of 
U.S. Patent Application serial No. 08/164,301, filed 
December 8, 1993, which is a continuation-in-part of U.S. 
Application Serial No. 08/123,732, filed September 20, 
1993, from which priority is claimed pursuant to 35 
U.S.C. § 120, and which disclosures are hereby 
incorporated by reference in their entireties. 

Technic al Field 

The present invention relates generally to 
polyketides and polyketide synthases. In particular, the 
invention pertains to the recombinant production of 
polyketides using a novel host-vector system. 

TVirVjrri-m-* ^ Tnvention 

Polyketides are a large, structurally diverse 
family of natural products. Polyketides possess a broad 
range of biological activities including antibiotic and 
pharmacological properties. For example, polyketides are 
represented by such antibiotics as tetracyclines and 
erythromycin, anticancer agents including daunomycin, 
immunosuppressants, for example FK506 and rapamycin, and 
veterinary products such as monensin and avermectm. 
Polyketides occur in most groups of organisms and are 
especially abundant in a class of mycelial bacteria, the 
actinomycetes, which produce various polyketides. 
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Polyketide synthases (PKSs) are multifunctional 
enzymes related to fatty acid synthases (FASs) . PKSs 
catalyze the biosynthesis of polyketides through repeated 
(decarboxylative) Claisen condensations between 
acylthioesters, usually acetyl, propionyl, malonyl or 
methylmalonyll Following each condensation, they 
introduce structural variability into the product by 
catalyzing all, part, or none of a reductive cycle 
comprising a ketoreduction, dehydration, and 
enoylreduction on the 6-keto group of the growing 
polyketide chain. After the carbon chain has grown to a 
length characteristic of each specific product, it is 
released from the synthase by thiolysis or acyltransf er . 
Thus, PKSs consist of families of enzymes which work 
together to produce a given polyketide. It is the 
controlled variation in chain length, choice of 
chain-building units, and the reductive cycle, 
genetically programmed into each PKS, that contributes to 
the variation seen among naturally occurring polyketides. 

Two general classes of PKSs exist. One class, 
known as Type I PKSs, is represented by the PKSs for 
macrolides such as erythromycin. These "complex 11 or 
"modular" PKSs include assemblies of several large 
multifunctional proteins carrying, between them, a set of 
separate active sites for each step of carbon chain 
assembly and modification (Cortes, J. et al. Nature 
(1990) 348:1*76; Donadio, S. et al. Science (1991) 
252 :675; MacNeil, D.J. et al. Gene (1992) 115 :119) . 
Structural diversity occurs in this class from variations 
in the number and type of active sites in the PKSs. This 
class of PKSs displays a one-to-one correlation between 
the number and clustering of active sites in the primary 
sequence of the PKS and the structure of the polyketide 
backbone. 
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The second class of PKSs, called Type II PKSs, 
is represented by the synthases for aromatic compounds. 
Type II PKSs have a single set of iteratively used active 
sites (Bibb, M.J. et al. EHBO J*. (1989) 8:2727; Sherman, 
D.H. et-al. EMBO J. (1989) 8:2717; Fernandez -Moreno, M. A* 
et al. J". Biol. Chem. (1992) 267:19278). 

Streptomyces is an actinomycete which is an 
abundant producer of aromatic polyketides. In each 
Streptomyces aromatic PKS so far studied, carbon chain 
assembly requires the products of three open reading 
frames (ORFs) . 0RF1 encodes a ketosynthase (KS) and an 
acyltransf erase (AT) active site; ORF2 encodes a protein 
similar to -the ORF1 product but lacking the KS and AT 
motifs; and ORF3 encodes a discrete acyl carrier protein 
(ACP) . 

Streptomyces coelicolor produces the 
blue-pigmented polyketide, actinorhodin . The 
actinorhodin gene cluster (act) , has been cloned 
(Malpartida, F. and Hopwood, D.A. Nature (1984) 309 :462; 
Malpartida, F. and Hopwood, D.A. Mol . Gen. Genet. (1986) 
205 : 66) and completely sequenced (Fernandez-Moreno, M.A. 
et al. J. Biol. Che/n. (1992) 267:19278; Kalian, S.E. et 
al. Gene (1988) 74:305; Fernandez-Moreno, M.A, et al. 
Cell (1991) 66:769; Caballero, J. et al. Mol. Gen. Genet. 
(1991) 230 :401) . The cluster encodes the PKS enzymes 
described above, a cyclase and a series of tailoring 
enzymes involved in subsequent modification reactions 
leading to actinorhodin, as well as proteins involved in 
export of the antibiotic and at least one protein that 
specifically activates transcription of the gene cluster. 
Other genes required for global regulation of antibiotic 
biosynthesis, as well as for the supply of starter 
(acetyl CoA) and extender (malonyl CoA) units for 
polyketide biosynthesis, are located elsewhere in the 
genome. 
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The act gene cluster from S. coelicolor has 
been used to produce actinorhodin in S. parvulus. 
Malpartida, F. and Hopwood, D.A. Nature (1984) 309 :462. 
Bartel et al. J. Bacterid. (1990) 172:4816-4826, 
recombinantly produced aloesaponarin II using S. 
galilaeus transformed with an S. coelicolor act gene 
cluster consisting of four genetic loci, actl, actlll, 
actIV and actvil. Hybrid PKSs, including the basic act 
gene set but with ACP genes derived from granaticin, 
oxy tetracycline, tetracenomycin and frenolicin PKSs, have 
also been designed which are able to express functional 
synthases. Khosla, C. et al. J\ Bacterid . (1993) 
175:2197-2204. Hopwood, D.A. et al. Nature (1985) 
314 :642-644, describes the production of hybrid 
polyketides, using recombinant techniques. Sherman, D. H. 
et al. J . Bacterid. (1992) 174.: 6184-6190, reports the 
transformation of various S. coelicolor mutants, lacking 
different components of the act PKS gene cluster, with 
the corresponding granaticin (grra) genes from S. 
violaceoruber, in trans. 

However, no one to date has described the 
recombinant production of polyketides using genetically 
engineered host cells which substantially lack their 
entire native PKS gene clusters. 

Summary of the Invention 

The present invention provides for novel 
polyketides and novel methods of efficiently producing 
both new and known polyketides, using recombinant 
technology. In particular, a novel host-vector system is 
used to produce PKSs which in turn catalyze the 
production of a variety of polyketides. Such polyketides 
are useful as antibiotics, antitumor agents, 
immunosuppressants and for a wide variety of other 
pharmacological purposes. 
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Accordingly, in one embodiment, the invention 
is directed to a genetically engineered cell which 
expresses a polyketide synthase (PKS) gene cluster in its 
native, nontransf ormed state, the genetically engineered 
cell substantially lacking the entire native PKS gene 
cluster. 

In another embodiment, the invention is 
directed to the genetically engineered cell as described 
above, wherein the cell comprises: 

(a) a replacement PKS gene cluster which 
encodes a PKS capable of catalyzing the synthesis of a 
polyketide; and 

(b) one or more control sequences operatively 
linked to the PKS gene cluster, whereby the genes in the 
gene cluster can be transcribed and translated in the 
genetically engineered cell, 

with the proviso that when the replacement PKS 
gene cluster comprises an entire PKS gene set, at least 
one of the PKS genes or control elements is heterologous 
to the 'cell. 

In particularly preferred embodiments, the 
genetically engineered cell is Streptomyces coelicolor, 
the cell substantially lacks the entire native 
actinorhodin PKS gene cluster and the replacement PKS 
gene cluster comprises a first gene encoding a PKS 
ketosynthase and a PKS acyltransf erase active site 
(KS/AT) , a second gene encoding a PKS chain length 
determining factor (CLF) , and a third gene encoding a PKS 
acyl carrier protein (ACP) . 

In another embodiment, the invention is 
directed to a method for producing a recombinant 
polyketide comprising: 

(a) providing a population of cells as 
described above; and 
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(b) culturing the population of cells under 
conditions whereby the replacement PKS gene cluster 
present in the cells, is expressed. 

In still another embodiment, the invention is 
directed to a method for producing a recombinant 
polyketide comprising: 

a. inserting a first portion of a replacement 
PKS gene cluster into a donor plasmid and inserting a 
second portion of a replacement PKS gene cluster into a 
recipient plasmid, wherein the first and second portions 
collectively encode a complete replacement PKS gene 
cluster, and further wherein: 

i. the donor plasmid expresses a gene 
which encodes a first selection marker and is capable of 
replication at a first, permissive temperature and 
incapable of replication at a second, non-permissive 
temperature; 

ii. the recipient plasmid expresses a 
gene which encodes a second selection marker; and 

iii. the donor plasmid comprises regions 
of DNA complementary to regions of DNA in the recipient 
plasmid, such that homologous recombination can occur 
between the first portion of the replacement PKS gene 
cluster and the second portion of the replacement gene 
cluster, whereby a complete replacement gene cluster can 
be generated; 

b. transforming the donor plasmid and the 
recipient plasmid into a host cell and culturing the 
transformed host cell at the first, permissive 
temperature and under conditions which allow the growth 
of host cells which express the first and/or the second 
selection markers, to generate a first population of 
cells ; 

c. culturing the first population of cells at 
the second, non-permissive temperature and under 
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conditions which allow the growth of cells which express 
the first and/ or the second selection markers, to 
generate a second population of cells which includes host 
cells which contain a recombinant plasmid comprising a 
5 complete PKS replacement gene cluster; 

d. transferring the recombinant plasmid from 
the second population of cells into the genetically 
engineered cell of claim 1 to generate transformed 
genetically engineered cells; and 
10 e. culturing the transformed genetically 

CI engineered cells under conditions whereby the replacement 

PKS gene cluster present in the cells is expressed. 



In yet another embodiment , the invention is 
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15 directed to a polyketide compound having the structural 
fy| ' formula (I) 
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wherein: 

R 1 is selected from the group consisting of 
hydrogen and lower alkyl and R 2 is selected from the 
group consisting of hydrogen, lower alkyl and lower alkyl 
ester , or wherein R 1 and R 2 together form a lower 
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alkylene bridge optionally substituted with one to four 
hydroxyl or lower alkyl groups; 

R 3 and R 5 are independently selected from the 
group consisting of hydrogen, halogen, lower alkyl, lower 
alkoxy, * amino, lower alkyl mono- or di-substituted amino 
and nitro; 

R 4 is selected from the group consisting of 
halogen, r lower alkyl, lower alkoxy, amino, lower alkyl 
mono- or di-substituted amino and nitro; 

R 6 is selected from the group consisting of 
hydrogen, lower alkyl, and -CHR 7 -(CO)R 8 where R 7 and R 8 
are independently selected from the group consisting of 
hydrogen and lower alkyl; and 

i is 1, 2 or 3. 

In another embodiment, the invention related to 
novel polyketides having the structures 



HO' 




SEK4 (12) 
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Ha 




RM20C (15) 

or 



OH O OH 




SEK15b (16) 

In another embodiment, the invention is 
directed to a polyketide compound formed by catalytic 
cyclization of an enzyme-bound ketide having the 
structure (II) 
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wherein: 

R 11 is selected from the group consisting of 
methyl, -CH 2 (CO)CH 3 and -CH 2 (CO) CH 2 (CO) CH 3 ; 

R 12 is selected from the group consisting of 
-S-E and -CH 2 (CO) -S-E, wherein E represents a polyketide 
synthase produced by the genetically engineered cells 
above; and 

one of R 13 and R 14 is hydrogen and the other is 
hydroxyl, or R 13 and R 14 together represent carbonyl. 

In still another embodiment, the invention is 
directed to a method for producing an aromatic 
polyketide, comprising effecting cyclization of an 
enzyme-bound ketide having the structure (II) , wherein 
cyclization is induced by the polyketide synthase. 

In a further embodiment, the invention is 
directed to a polyketide compound having the structural 
formula (III) 
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wherein R 2 , R 4 and i are as defined above for structural 
formula (III). 
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In still anther embodiment, the invention is 
directed to a polyketide compound having the structural 
formula (V) * 



OR 2 O OR 2 




wherein R 2 , R 4 and i are as defined above for structural 
formula (III) . 

These and other embodiments of the subject 
invention will readily occur to those of ordinary skill 
in the art in view of the disclosure herein. 

Brief Description of the Figures 

Figure 1 shows the gene clusters for act, gra, 
and tern PKSs and cyclases. 

Figure 2 shows the strategy for making S. 
coelicolor CH999. Figure 2A depicts the structure of the 
act gene cluster present on the S. coelicolor CHI 
chromosome. Figure 2B shows the structure of pLRemEts 
and Figure 2C shows the portion of the CH999 chromosome 
with the act gene cluster deleted. 

Figure 3 is a diagram of plasmid pRM5. 

Figure 4 schematically illustrates formation of 
aloesaponarin II (2) and its carboxylated analog, 3,8- 
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dihydroxy-l-methylanthraquinone-2-carboxylic acid (1) as 
described in Example 3, 

Figure 5 provides the structures of 
actinorhodin (3), granaticin (4), tetracenomycin (5) -and 
mutactin (6),, referenced in Example 4. 

Figure 6 schematically illustrates the 
preparation, via cyclization of the polyketide 
precursors, of aloesaponarin II (2) , its carboxylated 
analog, 3 , 8~dihydroxy-l-methylanthraquinone-2-carboxylic 
acid (1), tetracenomycin (5) and new compound RM20 (9), 
as explained in Example 4, part (A). 

Figure 7 schematically illustrates the 
preparation, via cyclization of the polyketide 
precursors, of frenolicin (7), nanomycin (8) and 
actinorhodin (3). 

Figure 8 schematically illustrates the 
preparation, via cyclization of the polyketide 
precursors, of novel compounds RM20 (9) , RM18 (10) , RM18b 
(11), SEK4 (12), SEK15 (13), RM2 0b (14), RM20c (15) and 
SEK15b (16). 

Figure 9 depicts the genetic model for the 
6-deoxyerythronolide B synthase (DEBS) . 

Figure 10 shows the strategy for the 
construction of recombinant modular PKSs. 

Figure 11 is a diagram of plasmid pCK7. 

Detailed Description of the Invention 

The practice of the present invention will 
employ, unless otherwise indicated, conventional methods 
of chemistry, microbiology, molecular biology and 
recombinant DNA techniques within the skill of the art. 
Such techniques are explained fully in the literature. 
See, e.g., Sambrook, et al . Molecular Cloning: A 
Laboratory Manual (Current Edition) ; DNA Cloning: A 
Practical Approach, vol. I & II (D. Glover, ed.); 
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Oligonucleotide Synthesis (N. Gait, ed. , Current 
Edition); Nucleic Acid Hybridization (B. Hames & 
Higgins, eds. , • Current Edition); Transcription and 
Translation (B. Hames & S. Higgins, eds., Current 
Edition) . 

All publications, patents and patent 
applications cited herein, whether supra or infra, are 
hereby Incorporated by reference in their entirety. 

As used in this specification and the appended 
claims, the singular forms "a," "an" and "the" include 
plural references unless the content clearly dictates 
otherwise. Thus, reference to "a polyketide" includes 
mixtures of polyketides, reference to "a polyketide 
synthase" includes mixtures of polyketide synthases, and 
the like. 

A. Definitions 

In describing the present invention, the 
following terms will be employed, and are intended to be 
defined as indicated below. 

By "replacement PKS gene cluster" is meant any 
set of PKS genes capable of producing a functional PKS 
when under the direction of one or more compatible 
control elements, as defined below, in a host cell 
transformed therewith. A functional PKS is one which 
catalyzes the synthesis of a polyketide. The term 
"replacement PKS gene cluster" encompasses one or more 
genes encoding for the various proteins necessary to 
catalyze the production of a polyketide. A "replacement 
PKS gene cluster" need not include all of the genes found 
in the corresponding cluster in nature. Rather, the gene 
cluster need only encode the necessary PKS components to 
catalyze the production of an active polyketide. Thus, 
as explained further below, if the gene cluster includes, 
for example, eight genes in its native state and only 
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three of these genes are necessary to provide an active 
polyketide, only these three genes need be present* 
Furthermore, the cluster can include PKS genes derived 
from a single species, or may be hybrid in nature with, 
e.g., a gene derived from a cluster for the synthesis of 
a particular polyXetide replaced with a corresponding 
gene from a cluster for the synthesis of another 
polyketide. Hybrid clusters can include genes derived 
from both Type I and Type II PKSs. As explained above, 
Type I PKSs include several large multifunctional 
proteins carrying, between them, a set of separate active 
sites for each step of carbon chain assembly and 
modification. Type II PKSs, on the other hand, have a 
single set of iteratively used active sites. These 
classifications are well known. See, e.g., Hopwood, D.A. 
and Khosla, C. Secondary metabolites : their function and 
evolution (1992) Wiley Chichester (Ciba Foundation 
Symposium 171) p 88-112; Bibb, M.J. et al. EtiBO J. (1989) 
8:2727/ Sherman, D.H. et al. EHBO J. (1989) 8:2717; 
Fernandez-Moreno, M.A. et al. J. Biol. Chem. (1992) 
267 :19278) ; Cortes, J. et al. Nature (1990) 348:176; 
Donadio, S. et al. Science (1991) 252:675; MacNeil, D.J. 
et al. Gene (1992) 115:119. Hybrid clusters are 
exemplified herein and are described further below. The 
genes included in the gene cluster need not be the native 
genes, but can be mutants or analogs thereof. Mutants or 
analogs may be prepared by the deletion , insertion or 
substitution of one or more nucleotides of the coding 
sequence. Techniques for modifying nucleotide sequences, 
such as site-directed mutagenesis, are described in, 
e.g., Sambrook et al . , supra; DNA Cloning, Vols. I and 
II, supra; Nucleic Acid Hybridization, supra, 

A "replacement PKS gene cluster 11 may also 
contain genes coding for modifications to the core 
polyketide catalyzed by the PKS, including, for example, 
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genes encoding hydroxylases, methylases or other 
alkylases, oxidases, reductases, glycotransf erases, 
lyases, ester or amide synthases, and various hydrolases 
such as esterases and amidases. 

As explained further below, the genes included 
in the replacement gene cluster need not be on the same 
plasmid or if present on the same plasmid, can be 
controlled by the same or different control sequences. 

By "genetically engineered host cell" is meant 
a host cell where the native PKS gene cluster has been 
deleted using recombinant DNA techniques. Thus, the term 
would not encompass mutational events occurring in 
nature. A "host cell" is a cell derived from a 
procaryotic microorganism or a eucaryotic cell line 
cultured as a unicellular entity, which can be, or has 
been, used as a recipient for recombinant vectors bearing 
the PKS gene clusters of the invention. The term 
includes the progeny of the original cell which has been 
transfected. It is understood that the progeny of a 
single parental cell may not necessarily be completely 
identical in morphology or in genomic or total DNA 
complement to the original parent, due to accidental or 
deliberate mutation. Progeny of the parental cell which 
are sufficiently similar to the parent to be 
characterized by the relevant property, such as the 
presence of a nucleotide sequence encoding a desired PKS, 
are included in the definition, and are covered by the 
above terms. 

The term "heterologous" as it relates to 
nucleic acid sequences such as coding sequences and 
control sequences, denotes sequences that are not 
normally associated with a region of a recombinant 
construct, and/or are not normally associated with a 
particular cell. Thus, a "heterologous" region of a 
nucleic acid construct is an identifiable segment of 
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nucleic acid within or attached to another nucleic acid 
molecule that is not found in association with the other 
molecule in nature. For example, a heterologous region 
of a construct* could include a coding sequence flanked by 
sequences not found in association with the coding 
sequence in nature. Another example of a heterologous 
coding sequence is a construct where the coding sequence 
itself is not found in nature (e.g., synthetic sequences 
having codons different from the native gene) . 
Similarly, a host cell transformed with a construct which 
is not normally present in the host cell would be 
considered heterologous for purposes of this invention. 
Allelic variation or naturally occurring mutational 
events do not give rise to heterologous DNA, as used 
herein. 

A "coding sequence" or a sequence which 
"encodes" a particular PKS, is a nucleic acid sequence 
which is transcribed (in the case of DNA) and translated 
(in the case of mRNA) into a polypeptide in vitro or in 
vivo when placed under the control of appropriate 
regulatory sequences. The boundaries of the coding 
sequence are determined by a start codon at the 5' 
(amino) terminus and a translation stop codon at the 3' 
(carboxy) terminus. A coding sequence can include, but 
is not limited to, cDNA from procaryotic or eucaryotic 
mRNA, genomic DNA sequences from procaryotic or 
eucaryotic DNA, and even synthetic DNA sequences. A 
transcription termination sequence will usually be 
located 3' to the coding sequence. 

A "nucleic acid" sequence can include, but is 
not limited to, procaryotic sequences, eucaryotic mRNA, 
cDNA from eucaryotic mRNA, genomic DNA sequences from 
eucaryotic (e.g., mammalian) DNA, and even synthetic DNA 
sequences. The term also captures sequences that include 
any of the known base analogs of DNA and RNA such as, but 
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not limited to 4-acetylcytosine, 8-hydroxy-N6- 
roethyladenosine, aziridinylcytosine, pseudoisocytosine, 
5- (carboxyhydroxylmethyl ) uracil , 5-f luorouracil , 
5-bromouracil , 5-carboxyiriethylaminomethyl-2-thiouracil , 
5-carboxymethylaroinomethyluracil, dihydrouracil, inosine, 
N6-isopentenyladenine, 1-methyladenine, 1-methylpseudo- 
uracil , l-methylguanine, 1-methylinosine , 2 , 2-dimethyl- 
guanine , 2 -methyladenine , 2 -methy Iguanine , 3 -methyl- 
cytosine, 5-methylcytosine, NS-methyladenine, 
7 -methy Iguanine, 5-methylaminomethyluracil, 5-methoxy- 
aminomethyl-2-thiouracil, beta-D-mannosylqueosine, 
5'-methoxycarbonylmethyluracil, 5-methoxyuracil, 
2-methylthio~N6-isopentenyladenine / uracil-5-oxyacetic 
acid methylester, uracil-5-oxyacetic acid, oxybutoxosine, 
pseudouracil, queosine, 2-thiocytosine, 5-methyl- 
2-thiouracil, 2-thiouracil , 4-thiouracil , 5-nethyluracil , 
N-uracil-5-oxyacetic acid methylester , uracil-5-oxyacetic 
acid, pseudouracil, queosine, 2-thiocytosine, and 
2 , 6-diaminopurine. A transcription termination sequence 
will usually be located 3 ' to the coding sequence. 

DNA "control sequences" refers collectively to 
promoter sequences, ribosome binding sites, 
polyadenylation signals, transcription termination 
sequences, upstream regulatory domains, enhancers, and 
the like, which collectively provide for the 
transcription and translation of a coding sequence in a 
host cell. Not all of these control sequences need 
always be present in a recombinant vector so long as the 
desired gene is capable of being transcribed and 
translated. 

"Operably linked 1 ' refers to an arrangement of 
elements wherein the components so described are 
configured so as to perform their usual function. Thus, 
control sequences operably linked to a coding sequence 
are capable of effecting the expression of the coding 
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sequence. The control sequences need not be contiguous 
with the coding sequence , so long as they function to 
direct the expression thereof. Thus, for example, 
intervening untranslated yet transcribed sequences can be 
present -between a promoter sequence and the coding 
sequence and the promoter sequence can still be 
considered "operably linked 11 to the coding sequence. 

By "selection marker" is meant any genetic 
marker which can be used to select a population of cells 
which carry the marker in their genome. Examples of 
selection markers include: auxotrophic markers by which 
cells are selected by their ability to grow on minimal 
media with or without a nutrient or supplement, e.g., 
thymidine, diaminopimelic acid or biotin; metabolic 
markers by which cells are selected for their ability to 
grow on minimal media containing the appropriate sugar as 
the sole carbon source or the ability of cells to form 
colored colonies containing the appropriate dyes or 
chromogenic substrates; and drug resistance markers by 
which cells are selected by their ability to grow on 
media containing one or more of the appropriate drugs, 
e.g., tetracycline, ampicillin, kanamycin, streptomycin 
or nalidixic acid. 

"Recombination" is a the reassortment of 
sections of DNA sequences between two DNA molecules. 
"Homologous recombination" occurs between two DNA 
molecules which hybridize by virtue of homologous or 
complementary nucleotide sequences present in each DNA 
molecule. 

The term T, alkyl ,r as used herein refers to a 
branched or unbranched saturated hydrocarbon group of 1 
to 24 carbon atoms, such as methyl, ethyl, n-propyl, 
isopropyl, n-butyl, isobutyl, t-butyl, octyl, decyl, 
tetradecyl, hexadecyl, eicosyl, tetracosyl and the like. 
Preferred alkyl groups herein contain 1 to 12 carbon 
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atoms. The term "lower alkyl" intends an alkyl group of 
one to six carbon atoms, preferably one to four carbon 
atoms . 

The term "alkylene" as used herein refers to a 
difunctional saturated branched or unbranched hydrocarbon 
chain containing from 1 to 24 carbon atoms, and includes, 
for example, methylene (-CH 2 -J , ethylene (-CH 2 -CH 2 -) , 
propylene (-CH 2 -CH 2 -CH 2 -) , 2-methylpropylene [-CH 2 - 
CH(CH 3 )-CH 2 -] , hexylene [~(CH 2 ) 6 -] and the like* "Lower 
alkylene" refers to an alkylene group of 1 to 6, more 
preferably 1 to 4, carbon atoms. 

The term "alkoxy" as used herein intends an 
alkyl group bound through a single, terminal ether 
linkage; that is, an "alkoxy" group may be defined as -OR 
where R is alkyl as defined above. A "lower alkoxy" 
group intends an alkoxy group containing one to six, more 
preferably one to four, carbon atoms. 

"Halo" or "halogen" refers to fluoro, chloro, 
bromo or iodo, and usually relates to halo substitution 
for a hydrogen atom in an organic compound. Of the 
halos, chloro and fluoro are generally preferred. 

"Optional" or "optionally" means that the 
subsequently described event or circumstance may or may 
not occur, and that the description includes instances 
where said event or circumstance occurs and instances 
where it does not. For example, the phrase "optionally 
substituted alkylene" means that an alkylene moiety may 
or may not be substituted and that the description 
includes both unsubstituted alkylene and alkylene where 
there is substitution. 

B. General Methods 

Central to the present invention is the 
discovery of a host-vector system for the efficient 
recombinant production of both novel and known 



Docket No. "400-0002 .21 
PATENT 



-22- 



polyketides. In particular, the invention makes use of 
genetically engineered cells which have their naturally 
occurring PKS genes substantially deleted. These host 
cells can be transformed with recombinant vectors, 
encoding a variety of PKS gene clusters, for the 
production of active polyketides. The invention provides 
for the production of significant quantities of product 
at an appropriate stage of the growth cycle. The 
polyketides so produced can be used as therapeutic 
agents, to treat a number of disorders, depending on the 
type of polyketide in question. For example, several of 
the polyketides produced by the present method will find 
use as immunosuppressants, as anti-tumor agents, as well 
as for the treatment of viral, bacterial and parasitic 
infections. The ability to recombinantly produce 
polyketides also provides a powerful tool for 
characterizing PKSs and the mechanism of their actions. 

More particularly, host cells for the 
recombinant production of the subject polyketides can be 
derived from any organism with the capability of 
harboring a recombinant PKS gene cluster. Thus, the host 
cells of the present invention can be derived from either 
procaryotic or eucaryotic organisms. However, preferred 
host cells are those constructed from the actinomycetes, 
a class of mycelial bacteria which are abundant producers 
of a number of polyketides. A particularly preferred 
genus for use with the present system is Streptomyces . 
Thus, for example, 5. ambofaciens , 5. avermitilis , 5. 
azureus, S. cinnamonensis , 5. coelicolor t 5. curacoi, S. 
erythraeus, 5. fradiae, 5. galilaeus , S. glaucescens , S. 
hygroscopicus , S. lividans, S. parvulus , S. peucetius , S. 
rimosuSf s. roseofulvus , s, thermotolerans , S. 
violaceoruber , among others, will provide convenient host 
cells for the subject invention, with 5. coelicolor being 
preferred, (See, e.g., Hopwood, D.A. and Sherman, D.H. 
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Ann. Rev. Genet. (1990) 24:37-66; O'Hagan, D . The 
Polyketide Metabolites (Ellis Horwood Limited, 1991) , for 
a description of various polyketide-producing organisms 
and their natural products.) 

The above-described cells are genetically 
engineered by deleting the naturally occurring PKS genes 
therefrom, using standard techniques, such as by 
homologous recombination, (See, e.g., Khosla, C. et al. 
Molec. Microbiol. (1992) .6:3237). Exemplified herein is 
a genetically engineered 5. coelicolor host cell. Native 
strains of S. coelicolor produce a PKS which catalyzes 
the biosynthesis of the aromatic polyketide actinorhodin 
(structure 3, Figure 5). The novel strain, S. coelicolor 
CH999 (Figure 2C and described in the examples) , was 
constructed by deleting/ via homologous recombination, 
the entire natural act cluster from the chromosome of S. 
coelicolor CHI (Khosla, C. Molec. Microbiol. (1992) 
6:3237) , a strain lacking endogenous plasmids and 
carrying a stable mutation that blocks biosynthesis of 
another pigmented S. coelicolor antibiotic, 
undecylprodigiosin. 

The host cells described above can be 
transformed with one or more vectors, collectively 
encoding a functional PKS set. The vector (s) can include 
native or hybrid combinations of PKS subunits, or mutants 
thereof. As explained above, the replacement gene 
cluster need not correspond to the complete native gene 
cluster but need only encode the necessary PKS components 
to catalyze the production of a polyketide. For example, 
in each Streptomyces aromatic PKS so far studied, carbon 
chain assembly requires the products of three open 
reading frames (ORFs) . 0RF1 encodes a ketosynthase (KS) 
and an acyltransferase (AT) active site (KS/AT) ; as 
elucidated herein, ORF2 encodes a chain length 
determining factor (CLF) , a protein similar to the 0RF1 
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product but lacking the KS and AT motifs; and ORF3 
encodes a discrete acyl carrier protein (ACP) . Some gene 
clusters also code for a ketoreductase (KR) and a 
cyclase, involved in cyclization of the nascent 
polyketide backbone. (See Figure 1 for a schematic 
representation of three PKS gene clusters,) However, it 
has been found that only the KS / AT , CLF, and ACP, need be 
present, in order to produce an identifiable polyketide. 
Thus, in the case of aromatic PKSs derived from 
Str&ptomyces , these three genes, without the other 
components of the native clusters, can be included in one 
or more recombinant vectors, to constitute a "minimal" 
replacement PKS gene cluster. 

Furthermore, the recombinant vector (s) can 
include genes from a single PKS gene cluster, or may 
comprise hybrid replacement PKS gene clusters with, e.g., 
a gene for one cluster replaced by the corresponding gene 
from another gene cluster. For example, it has been 
found that ACPs are readily interchangeable among 
different synthases without an effect on product 
structure. Furthermore, a given KR can recogni2e and 
reduce polyketide chains of different chain lengths. 
Accordingly, these genes are freely interchangeable in 
the constructs described herein. Thus, the replacement 
clusters of the present invention can be derived from any 
combination of PKS gene sets which ultimately function to 
produce an identifiable polyketide. 

Examples of hybrid replacement clusters include 
clusters with genes derived from two or more of the act 
gene cluster, frenolicin {fren) , granaticin (gxa) , 
tetracenomycin (tern) , S-methylsalicylic acid (6~msas) , 
oxytetracycline (otc) , tetracycline (tet) , erythromycin 
(ery) , griseusin, nanaomycin, medermycin, daunorubicin, 
tylosin, carbomycin, spiramycin, avermectin, monensin, 
nonactin, curamycin, rifamycin and candicidin synthase 
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gere clusters, among others. (For a discussion of 
various PKSs, see, e.g., Hopwood, D. A. and Sherman, D.H. 
Ann. Rev. Genet. (1990) 24.^37-66; O'Hagan, D. The 
Polyketida Metabolites (Ellis Horwood Limited, 1991.) 

Mor,e particularly, a number of hybrid gene 
clusters have been constructed herein, having components 
derived from the act, fren f tern and g/ra gene clusters, as 
depicted- in Tables 1 and 2. Several of the hybrid 
clusters were able to functionally express both novel and 
known polyketides in S. coelicolor CH999 (described 
above). However, other hybrid gene clusters, as 
described above, can easily be produced and screened 
using the disclosure herein, for the production of 
identifiable polyketides. For example, a library of 
randomly cloned ORF l and 2 homologs, from a collection 
of actinomycetes, could be constructed and screened for 
identifiable polyketides. Longer polyketides might also 
be cyclized by replacing, e.g., an act, gra r fren or tern 
cyclase gene with a homolog from a PKS gene cluster which 
produces a chain of the correct length. Finally, a 
considerable degree of variability exists for non-acetate 
starter units among certain naturally occurring aromatic 
PKSs; thus, these units can also be used for obtaining 
novel polyketides via genetic engineering. 

Additionally, the recombinant vectors can 
include genes from a modular PKS gene cluster. Such gene 
clusters are described in further detail below. 

The recombinant vectors, harboring the gene 
clusters described above, can be conveniently generated 
using techniques known in the art. For example, the PKS 
subunits of interest can be obtained from an organism 
that expresses the same, using recombinant methods, such 
as by screening cDNA or genomic libraries, derived from 
cells expressing the gene, or by deriving the gene from a 
vector known to include the same. The gene can then be 
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isolated and combined with other desired PKS subunits, 
using standard techniques. If the gene in question is 
already present in a suitable expression vector, it can 
be combined in situ, with, e.g., other PKS subunits, as 
desired; The gene of interest can also be produced 
synthetically, rather than cloned. The nucleotide 
sequence can be designed with the appropriate codons for 
the particular amino acid sequence desired. In general / 
one will select preferred codons for the intended host in 
which the sequence will be expressed. The complete 
sequence is assembled from overlapping oligonucleotides 
prepared by standard methods and assembled into a 
complete coding sequence. See t e.g., Edge (1981) Nature 
292 :756; Nambair et al . (1984) Science 223 : 1299 ; Jay et 
al. (1984) J. Biol. Chem. 259 : 6311. 

Mutations can be made to the native PKS subunit 
sequences and such mutants used in place of the native 
sequence, so long as the mutants are able to function 
with other PKS subunits to collectively catalyze the 
synthesis of an identifiable polyketide. Such mutations 
can be made to the native sequences using conventional 
techniques such as by preparing synthetic 
oligonucleotides including the mutations and inserting 
the mutated sequence into the gene encoding a PKS subunit 
using restriction endonuclease digestion. (See, e.g., 
Kunkel, T.A. Proc. Natl. Acad. Sci . USA (1985) 82:448; 
Geisselsoder et al . BioTechniques (1987) 5:786.) 
Alternatively, the mutations can be effected using a 
mismatched primer (generally 10-20 nucleotides in length) 
which hybridizes to the native nucleotide sequence 
(generally cDNA corresponding to the RNA sequence) , at a 
temperature below the melting temperature of the 
mismatched duplex. The primer can be made specific by 
keeping primer length and base composition within 
relatively narrow limits and by keeping the mutant base 
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centrally located. Zoller and Smith, Methods Enzymol. 
(1983) 100:468. Primer extension is effected using DNA 
polymerase, the product cloned and clones containing the 
mutated DNA, derived by segregation of the primer 
extended strand, selected. Selection can be accomplished 
using the mutant primer as a hybridization probe. The 
technique is also applicable for generating 
multiple ' point mutations. See, e.g., Dalbie-McFarland et 
al. Proc. Natl. Acad. Sci USA (1982) 79:6409. PCR 
mutagenesis will also find use for effecting the desired 
mutations. 

The gene sequences which collectively encode a 
replacement PKS gene cluster, can be inserted into one or 
more expression vectors, using methods known to those of 
skill in the art. Expression vectors will include 
control sequences operably linked to the desired PKS 
coding sequence. Suitable expression systems for use 
with the present invention include systems which function 
in eucaryotic and procaryotic host cells. However, as 
explained above, procaryotic systems are preferred, and 
in particular, systems compatible with Streptomyces spp. 
are of particular interest. Control elements for use in 
such systems include promoters, optionally containing, 
operator sequences, and ribosome binding sites. 
Particularly useful promoters include control sequences 
derived from PKS gene clusters, such as one or more act 
promoters. However, other bacterial promoters, such as 
those derived from sugar metabolizing enzymes, such as 
galactose, lactose (lac) and maltose, will also find use 
in the present constructs. Additional examples include 
promoter sequences derived from biosynthetic enzymes such 
as tryptophan (trp) , the ^-lactamase (Ma) promoter 
system, bacteriophage lambda PL, and T5. In addition, 
synthetic promoters, such as the tac promoter (U.S. 
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Patent No. 4,551,433), which do not occur in nature also 
function in bacterial host cells. 

Other regulatory sequences may also be 
desirable which allow for regulation of expression of the 
PKS replacement sequences relative to the growth of the 
host cell. Regulatory sequences are known to those of 
skill in the art, and examples include those which cause 
the expression of a gene to be turned on or off in 
response to a chemical or physical stimulus, including 
the presence of a regulatory compound, other types of 
regulatory elements may also be present in the vector, 
for example, enhancer sequences. 

Selectable markers can also be included in the 
recombinant expression vectors. A variety of markers are 
known which are useful in selecting for transformed cell 
lines and generally comprise a gene whose expression 
confers a selectable phenotype on transformed cells when 
the cells are grown in an appropriate selective medium. 
Such markers include, for example, genes which confer 
antibiotic resistance or sensitivity to the plasmid. 
Alternatively, several polyketides are naturally colored 
and this characteristic provides a built-in marker for 
selecting cells successfully transformed by the present 
constructs . 

The various PKS subunits of interest can be 
cloned into one or more recombinant vectors as individual 
cassettes, with separate control elements, or under the 
control of, e.g., a single promoter. The PKS subunits 
can include flanking restriction sites to allow for the 
easy deletion and insertion of other PKS subunits so that 
hybrid PKSs can be generated. The design of such unique 
restriction sites is known to those of skill in the art 
and can be accomplished using the techniques described 
above, such as site-directed mutagenesis and PCR. 
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Using these techniques, a novel plasmid, pRM5, 
(Figure 3 and Example 2) was constructed as a shuttle 
vector for the production of the polyketides described 
herein. Plasmid pRM5 includes the act genes encoding 
the KS/AT (0RF1) , CLF (0RF2) and ACP (ORF3) PKS subunits, 
flanked by Pad, Nsil and Xjbal restriction sites. Thus, 
analogous PKS subunits, encoded by other PKS genes, can 
be easily substituted for the existing act genes. (See, 
e.g., Example 4, describing the construction of hybrid 
vectors using pRM5 as the parent plasmid) . The shuttle 
plasmid also contains the act KR gene (actlll) , the 
cyclase gene (actVII) , and a putative dehydratase gene 
(actIV) , as'well as a ColEI replicon (to allow 
transformation of E. coli) , an appropriately truncated 
SCP2* (low copy number) Streptomyces replicon, and the 
actII-ORF4 activator gene from the act cluster, which 
induces transcription from act promoters during the 
transition from growth phase to stationary phase in the 
vegetative mycelium. pRH5 carries the divergent 
actl/actlll promoter pair. 

Methods for introducing the recombinant vectors 
of the present invention into suitable hosts are known to 
those of skill in the art and typically include the use 
of CaCl 2 or other agents, such as divalent cations and 
DMSO. DNA can also be introduced into bacterial cells by 
electroporation. Once the PKSs are expressed, the 
polyketide producing colonies can be identified and • 
isolated using known techniques. The produced 
polyketides can then be further characterized. 

As explained above, the above-described 
recombinant methods also find utility in the catalytic 
biosynthesis of polyketides by large, modular PKSs. For 
example, 6-deoxyerythronolide B synthase (DEBS) catalyzes 
the biosynthesis of the erythromycin aglycone, 
6-deoxyerythronolide B (17). Three open reading frames 
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(eryAX, eryAII, and eryAIII) encode the DEBS polypeptides 
and span 32 kb in the ery gene cluster of the 
Saccharopolyspora erythraea genome. The genes are 
organized in six repeated units, each designated a 
"module;" Each module encodes a set of active sites 
that, during polyketide biosynthesis, catalyzes the 
condensation of an additional monomer onto the growing 
chain. Each module includes an acyltransf erase (AT), 
0-ketoacyl carrier protein synthase (KS) , and acyl 
carrier protein (ACP) as well as a subset of reductive 
active sites (£-ketoreductase (KR) , dehydratase (DH) , 
enoyl reductase (ER) ) (Figure 9). The number of 
reductive sites within a module corresponds to the extent 
of 0-keto reduction in each condensation cycle. The 
thioesterase (TE) encoded at the end of module appears to 
catalyze lactone formation. 

Due to the large sizes of eryAI, eryAXJ, and 
eryAJJJ, and the presence of multiple active sites, these 
genes can be conveniently cloned into a plasmid suitable 
for expression in a genetically engineered host cell, 
such as CH999, using an in vivo recombination technique . 
This technique, described in Example 5 and summarized in 
Figure 10, utilizes derivatives of the plasmid pMAK705 
(Hamilton et al. (1989) J * Bacterid. 171:4617) to permit 
in vivo recombination between a temperature-sensitive 
donor plasmid, which is capable of replication at a 
first, permissive temperature and incapable of 
replication at a second, non-permissive temperature, and 
recipient plasmid. The eryA genes thus cloned gave pCK7, 
a derivative of pRM5 (McDaniel et al. (19 93) Science 
262:1546). A control plasmid, pCK7f, was constructed to 
carry a frameshift mutation in eryAI. pCK7 and pCK7f 
possess a ColEl replicon for genetic manipulation in E . 
coli as well as a truncated SCP2* (low copy number) 
Streptomyces replicon. These plasmids also contain the 
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divergent actl/actlll promoter pair and actII-0RF4, an 
activator gene, which is required for transcription from 
these promoters and activates expression during the 
transition from growth to stationary phase in the 
vegetative mycelium. High-level expression of PKS genes 
occurs at the onset of stationary phase of mycelial 
growth; the recombinant strains therefore produce 
"reporter" polyketides as secondary metabolites in a 
quasi-natural manner. 

The method described above for producing 
polyketides synthesized by large, modular PKSs may be 
used to produce other polyketides as secondary 
metabolites such as sugars, 0-lactams, fatty acids, 
aminoglycosides, terpinoids, non-ribosomal peptides, 
prostanoid hormones and the like. 

Using the above recombinant methods, a number 
of polyketides have been produced. These compounds have 
the general structure (I) 



(I) 




wherein R 1 , R 2 , R 3 , R 4 , R 5 , R 5 , r 7 , r 8 and i are as 
defined above. One group of such compounds are wherein: 
R 1 is lower alkyl, preferably methyl; R 2 , R 3 and R 6 are 
hydrogen; R 6 is -CHR 7 - (CO) -R 8 ; and i is 0. A second 
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group of such compounds are wherein: R 1 and R 6 are lower 
alkyl, preferably methyl; R 2 , R 3 and R 5 are hydrogen; and 
i is 0. Still a third group of such compounds are 
wherein: R 1 and R 2 are linked together to form a lower 
alkylene bridge -CHR 9 -CHR 10 wherein R 9 and R 10 are 
independently selected from the group consisting of 
hydrogen, hydroxyl and lower alkyl, e.g., -CH 2 -CH0H-; R 3 
and R 5 are hydrogen; R 6 is -CHR 7 - (CO) -R 8 where R 8 is 
hydrogen or lower alkyl, e.g., -CH 2 - (CO) -CH 3 ; and i is 0. 
Specific such compounds include the following compounds 
9, 10 and 11 as follows: 




HO ^ 20 



8 



6 



4 



2 
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OH 




Preparation of compounds 9, 10, 11, 12, 13, 14, 15 and 16 
is effected by cyclization of an enzyme-bound polyketide 
having the structure (II) 



35 
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(II) 

5 




wherein R 11 , R 12 , R 13 and R 14 and E are as defined earlier 
tJ 10 herein. Examples of such compounds include: a first 

© group wherein R 11 is methyl and R 12 is -CH 2 (CO) -S-E; a 

second group wherein R 11 is -CH 2 (CO) CH 3 and R 12 is -S-E; a 
third group wherein R 11 is -CH 2 (CO)CH 3 and R 12 is - 



m 
m 



pi CH 2 (CO)-S-E; and a fourth group wherein R IX is - 

yj 15 CH 2 (CO)CH 2 (CO)CH 3 and R 12 is -CH 2 (C0)-S~E (see Figure 8 

gl for structural exemplification) . 

ts The remaining structures encompassed by generic 

!«3 formula (I) — i.e., structures other than 9, 10 and 11 — 

W may be prepared from structures 9, 10 or 11 using routine 

20 synthetic organic methods well-known to those skilled in 
the art of organic chemistry, e.g., as described by H.O. 
House, Modern Synthetic Reactions , Second Edition (Menlo 
Park, CA: The Ben jamin/Cummings Publishing Company, 
1972) , or by J. March, Advanced Organic Chemistry: 
25 Reactions, Mechanisms and Structure , 4th Ed. (New York; 
Kiley-Interscience, 1992) , the disclosures of which are 
hereby incorporated by reference. Typically, as will be 
appreciated by those skilled in the art, incorporation of 
substituents on the aromatic rings will involve simple 
30 electrophilic aromatic addition reactions. Structures 12 
and 13 may be modified in a similar manner to produce 
polyketides which are also intended to be within the 
scope of the present invention. 
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and general structure (V) 



OR 2 O OR* 




Particularly preferred compounds of structural formulas 
(III), (IV) and (V) are wherein: R 2 is hydrogen and i is 
0. 

C. Experimental 

Below are examples of specific embodiments for 
carrying out the present invention. The examples are 
offered for illustrative purposes only, and are not 
intended to limit the scope of the present invention in 
any way. 

Efforts have been made to ensure accuracy with 
respect to numbers used (e.g., amounts, temperatures, 
etc.), but some experimental error and deviation should, 
of course, be allowed for. 

Materials and Methods 
Bacterial strains, plasmids, and culture 
conditions. S. coelicolor CH999 was used as a host for 
transformation by all plasmids. The construction of this 
strain is described below. DNA manipulations were 
performed in Escherichia coli MC1061. Plasmids were 
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passaged through E. coli ET12567 (dam dcm hsdS Cm r ) 
(MacNeil, D.J. J » Bacterid. (1988) 170 :5607) to generate 
unmethylated DNA prior to transformation of S. 
coelicolor. E. coli strains were grown under standard 
conditions. S. coelicolor strains were grown on K2YE 
agar plates (Hopwood, D.A. et ai. Genetic manipulation of 
Streptomyces . A laboratory manual. The John Innes 
Foundation: Norwich, 1985) . 

Manipulation of DNA and organisms. Polymerase 
chain reaction (PCR) was performed using Taq polymerase 
(Perkin Elmer Cetus) under conditions recommended by the 
enzyme manufacturer. Standard in vitro techniques were 
used for DNA manipulations (Sambrook, et al . Molecular 
Cloning: A Laboratory Manual (Current Edition) ) . E. 
coli was transformed with a Bio-Rad E. Coli Pulsing 
apparatus using protocols provided by Bio-Rad. S. 
coelicolor was transformed by standard procedures 
(Hopwood, D.A. et al. Genetic manipulation of 
Streptomyces. A laboratory manual. The John Innes 
Foundation: Norwich, 1985) and transf ormants were 
selected using 2 ml of a 500 mg/ml thiostrepton overlay. 

Construction of plasmids containing recombinant 
BKSs. All plasmids are derivatives of pRM5, described 
below, fren PKS genes were amplified via PCR with 5' and 
3' restriction sites flanking the genes in accordance 
with the location of cloning sites on pRM5 (i.e. 
Pacl-Nsil for 0RF1, Nsil-Xbal for 0RF2, and Xbal-Pstl for 
ORF3) . Following subcloning and sequencing, the 
amplified fragments were cloned in place of the 
corresponding fragments in pRM5 to generate the plasmids 
for transformation. 
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Production and purification of polyketides. 
For initial screening, all strains were grown at 30°c as 
confluent lawns on 10-30 plates each containing 
approximately 3 0 ml of agar medium for 6-3 days. 
Additional plates were made as needed to obtain 
sufficient material for complete characterization. CH999 
was a negative control when screening for potential 
polyketides. The agar was finely chopped and extracted 
with ethyl acetate/ 1% acetic acid or ethyl 
acetate: methanol (4:1}/1% acetic acid. The concentrated 
extract was then flashed through a silica gel (Baker 40 
mm) chromatography column in ethyl acetate/ 1% acetic 
acid. Alternatively, the extract was applied to a 
Florisil column (Fisher Scientific) and eluted with ethyl 
acetate: ethanolracetic acid (17:2:1). The primary yellow 
fraction was further purified via high-performance liquid 
chromatography (HPLC) using a 20-60% 
acetonitrile/water/l% acetic acid gradient on a 
preparative reverse phsse (C-18) column (Beckman) . 
Absorbance was monitored at 280nm and 410nm. In general, 
the yield of purified product from these strains was 
approximately 10 mg/1 for compounds 1 and 2 (Figure 4), 
and 5 mg/1 for compounds 7 and 8 (Figure 7) . 

SEK4, (12), was produced and purified as 
follows. CH999/pSEK4 was grown on 90 agar plates (- 34 
ml/plate) at 30°C for 7 days. The agar was chopped and 
extracted with ethyl acetate/methanol (4/1) in the 
presence of 1% acetic acid (3 x 1000 ml). Following 
removal of the solvent under vacuum, 200 ml of ethyl 
acetate containing 1% acetic acid were added. The 
precipitate was filtered and discarded, and the solvent 
was evaporated to dryness. The product mixture was 
applied to a Florisil column (Fisher Scientific) , and 
eluted with ethyl acetate containing 3% acetic acid. The 
first 100 ml fraction was collected, and concentrated 
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c?own to 5 ml. 1 ml methanol was added, and the mixture 
was kept at 4°C overnight. The precipitate was collected 
by filtration, and washed with ethyl acetate to give 850 
mg of pure product. R f = 0.48 (ethyl acetate with 1% 
acetic acid) Results from NMR spectroscopy on SEK4 are 
reported in Table 4. FAB HRMS (NBA), M + H +, calculated 
m/e 319.0818, observed m/e 319.0820. 

To produce SEK15 (13) and SEKlSb (16), 
CH999/pSEK15 was grown on 90 agar plates, and the product 
was extracted in the same manner as SEK4 . The mixture 
was applied to a Florisil column (ethyl acetate with 5% 
acetic acid) , and fractions containing the major products 
were combined and evaporated to dryness. The products 
were further purified using preparative C-18 reverse 
phase HPLC (Beckman) (mobile phase: acetonitrile/water - 
1/10 to 3/5 gradient in the presence of 1% acetic acid) . 
The yield of SEK15 , (13), was 250 mg. R f = 0.41 (ethyl 
acetate with 1% acetic acid) . Results from NMR 
spectroscopy on SEK4 are reported in Table 4. FAB HRMS 
(NBA), M + H+, calculated m/e 385.0923, observed m/e 
385.0920. 

[l,2- i3 C 2 ] acetate feeding experiments* Two 2 
1 flasks each containing 400ml of modified NMP medium 
(Strauch, E. et al. Hoi. Microbiol. (1991) 5:289) were 
inoculated with spores of 5. coelicolor CH9 9 9/pRMl8, 
CH999/pSEK4 or CH999/pSEK15 , and incubated in a shaker at 
30 degrees C and 300 rpm. To each flask, 50mg of sodium 
[1,2- 13 C 2 ] acetate (Aldrich) was added at 72 and 96 hrs. 
After 120 hrs, the cultures were pooled and extracted 
with two 500 ml volumes of ethyl acetate/1% acetic acid. 
The organic phase was kept and purification proceeded as 
described above. 13 C NMR data indicate approximately a 
2-3% enrichment for the CH999/pRM18 product; a 0.5-1% 
enrichment for SEK4 and a 1-2% enrichment for SEK15. 
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KMR Spectroscopy ♦ All spectra were recorded on 
a Varian XL-400 except for HETCOR analysis of RM18 (10) 
(Figure 8), which was performed on a Nicolet NT-360. 13 c 
spectra were acquired with continuous broadband proton 
decoupling. , For NOE studies of RM18 (10), the 
one-dimensional difference method was employed. All 
compounds were dissolved in DMSO-d 6 (Sigma, 99+ atom % D) 
and spectra were referenced internally to the solvent. 
Hydroxyl resonances were identified by adding D 2 0 
(Aldrich, 99 atom % D) and checking for disappearance of 
signal. 

Example 1 
Production of S. coelicolor CH999 

An 5. coelicolor host cell, genetically 
engineered to remove the native act gens cluster and 
termed CH999, was constructed using S. coelicolor CHI 
(Khosla, C. Molec. Microbiol. (1992) 6:3237), using the 
strategy depicted in Figure 2. (CHI is derived from S. 
coelicolor B385 (Rudd, B. A.M. Genetics of Pigmented 
Secondary Metabolites in Streptomyces coelicolor (1978) 
Ph.D. Thesis, University of East Anglia, Norwich, 
England.) CHI includes the act gene cluster which codes 
for enzymes involved in the biosynthesis and export of 
the polyketide antibiotic actinorhodin. The cluster is 
made up of the PKS genes, flanked by several post-PKS 
biosynthetic genes including those involved in 
cyclization, aromatization, and subsequent chemical 
tailoring (Figure 2A) . Also present are the genes 
responsible for transcriptional activation of the act 
genes. The act gene cluster was deleted from CHI using 
homologous recombination as described in Khosla, C. et 
al. Molec. Microbiol. (1992) 6:3237. 

In particular, plasmid pLRermEts (Figure 2B) 
was constructed with the following features: a Col£I 



a 
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replicon from pBR322, the temperature sensitive replicon 
from pSG5 (Muth, G. et al. Hoi. Gen. Genet. (1989) 
219 :341) , ampicillin and thiostrepton resistance markers, 
and a disruption cassette including a 2 kb BamHI/XhoX 
5 fragment from the 5' end of the act cluster, a 1*5 kb 

ermE fragment (Khosla, C. et al. Molec. Microbiol. (1992) 
.6:3237), and a 1.9 kb SphX/Pstl fragment from the 3' end 
of the act cluster* The 5' fragment extended from the 
Bamttl site 1 (Kalpartida, F. and Hopwood, D.A. Nature 

p 10 (1984) 309:462; Malpartida, F. and Hopwood, D.A. Mol . 

% Gen. Genet. (1986) 205 : 66) downstream to a Xhol site. 

{ i% The 3' fragment extended from PstI site 20 upstream to 

jy SphI site 19.2 (Fernandez-Moreno, M.A. et al. J. Biol. 

ijl Chem. (1992) 267:19278). The 5' and 3' fragments (shown 

|lj 15 as hatched DNA in Figure 2) were cloned in the same 

•I 

Wv relative orientation as m the act cluster. CHI was 

m 

Mv. transformed with pLRermEts. The plasmid was subsequently 

j L. cured from candidate transf ormants by streaking 

| H 

;K non-selectively at 39°C. Several colonies that were 

V<& 

y. 20 lincomycin resistant, thiostrepton sensitive, and unable 

to produce actinorhodin, were isolated and checked via 

5 Southern blotting. One of them was designated CH999. 



Exam ple 2 

25 Production of the Recombinant Vector pRM5 

pRM5 (Figure 3) was the shuttle plasmid used 
for expressing PKSs in CH999. It includes a Col£I 
replicon to allow genetic engineering in E. coli, an 
appropriately truncated SCP2* (low copy number) 

30 Streptomyces replicon, and the actII-ORF4 activator gene 
from the act cluster, which induces transcription from 
act promoters during the transition from growth phase to 
stationary phase in the vegetative mycelium* As shown in 
Figure 3, pRM5 carries the divergent actl/actlll promoter 

35 pair, together with convenient cloning sites to 
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facilitate the insertion of a variety of engineered PKS 
genes downstream of both promoters. pRM5 lacks the par 
locus of SCP2* ; as a result the plasmid is slightly 
unstable (approx. 2% loss in the absence of 
thiostrepton) . This feature was deliberately introduced 
in order to allow for rapid confirmation that a phenotype 
of interest could be unambiguously assigned to the 
plasmid- borne mutant PKS. The recombinant PKSs from pRM5 
are expressed approximately at the transition from 
exponential to stationary phase of growth, in good 
yields. 

pRM5 was constructed as follows. A 10.5 kb 
Sphl/HindXII fragment from pIJ903 (containing a portion 
of the fertility locus and the origin of replication of 
SCP2* as well as the colEI origin of replication and the 
^-lactamase gene from pBK327) (Lydiate, D.J. Gene (1985) 
3_5:223) was ligated with a 1.5 kb Hindlll/SphX tsr gene 
cassette to yield pRMl. pRM5 was constructed by 
inserting the following two fragments between the unique 
Hindlll and £coRI sites of pRMl; a 0.3 kb 
HindXII/Hpal (blunt) fragment carrying a transcription 
terminator from phage fd (Khosla, C. et al. Molec. 
Microbiol. (1992) 6:3237), and a 10 kb fragment from the 
act cluster extending from the Ncol site (1 kb upstream 
of the ac£II-ORF4 activator gene) (Hallam, S.E. et al. 
Gene (1988) 74:305; Fernandez-Moreno, M.A. et al. Cell 

(1991) 66:769; Caballero, J.L. Mol . Gen. Genet. (1991) 
230 :401) to the PstI site downstream of the actl-VII-IV 
genes (Fernandez-Moreno, M.A. et al. J. Biol. Cher?. 

(1992) 267:19278) . 

To facilitate the expression of any desired 
recombinant PKS under the control of the actl promoter 
(which is activated by the acfcII-GRF4 gene product) , 
restriction sites for PacX, NsiX, XbaX, and PstI were 
engineered into the act DNA in intercistronic positions. 
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In pRM5, as well as in all other PKS expression plasmids 
described here, ORFl, 2, and 3 alleles were cloned 
between these sites as cassettes engineered with their 
own RBSs. 

In particular, in most naturally occurring 
aromatic polyketide synthase gene clusters in 
actinomycetes , ORFl and ORF2 are translationally coupled* 
In order 'to facilitate construction of recombinant PKSs, 
the 0RF1 and ORF2 alleles used here were cloned as 
independent (uncoupled) cassettes. For act ORFl, the 
following sequence was engineered into pRM5: 
CCACCGGACGAACGCATCGATTAATTAAGGAGGACCATCATG/ where the 
boldfaced sequence corresponds to upstream DNA from the 
actl region, TTAATTAA is the Pad recognition site, and 
ATG is the start codon of act ORF1. The following 
sequence was engineered between act ORF1 and ORF2: 
NTGAATGCATGGAGG AGCCATCATG , where TGA and ATG are the stop 
and start codons of ORF1 and ORF2, respectively, ATG CAT 
is the Nsil recognition site, and the replacement of N (A 
in act DNA, A or G in alleles from other PKSs) with a C 
results in translational decoupling. The following 
sequence was engineered downstream of act 0RF2: 
TAATCTAGA , where TAA is the stop codon, and TCTAGA is. 'the 
XJbal recognition site. This allowed fusion of act ORFl 
and 0RF2 (engineered as above) to an Xbal site that had 
been engineered upstream of act ORF3 (Khosla, C. et al. 
Molec. Microbiol. (1992) 6:3237), As a control, pRM2 was 
constructed, identical to pRM5, but lacking any of the 
engineered sequences. ORFl and 0RF2 in pRM2 are 
translationally coupled. Comparison of the product 
profiles of CH999/pRM2 and CH999/pRM5 revealed that the 
decoupling strategy described here had no detectable 
influence on product distribution or product levels. 



Docket No. 00-0002.21 
PATENT 

-46- 



Ex ample 3 

Polvketides Produced using CH999 Transformed with pRMS 
Plasmid pRM5 was introduced into S. coelicolor 
CH999 using standard techniques. (See, e.g., Sambrook, 
et al. Molecular Cloning: A Laboratory Manual (Current 
Edition.) CH999 transformed with pRM5 produced a large 
amount of yellowish-brown material. The two most 
abundant products were characterized by NHR and mass 
spectroscopy as aloesaponarin II (2) (Bartel, P.L. et al. 
J". Bacterid. (1990) 172 :4816) and its carboxylated 
analog, 3 , 8-dihydroxy-l-methylanthraquinone-2-carboxylic 
acid (1) (Cameron, D.W. et al. Liebigs Ann. Chem. (1989) 
7:699) (Figure 4). It is presumed that 2 is derived from 
1 by non-enzymatic decarboxylation (Bartel, P.L. et al. 
J*. Bacterid. (1990) 122:4816). Compounds 1 and 2 were 
present in approximately a 1:5 molar ratio. 
Approximately 100 irig of the mixture could be easily 
purified from 1 1 of culture. The CH999/pRM5 host-vector 
system was therefore functioning as expected to produce 
significant amounts of a stable, only minimally modified 
polyketide metabolite. The production of 1 and 2 is 
consistent with the proposed pathway of actinorhodin 
biosynthesis (Bartel, P.L. et al. J. Bacterid . (1990),- 
172.: 4816) . Both metabolites, like the actinorhodin 
backbone, are derived from a 16-carbon polyketide with a 
single ketoreduction at C-9. 

When CH999 was transformed with pSEK4 f 
identical to pRM5 except for replacement of a 140 bp 
Sphl/Sall fragment within the act KR gene by the 
Sphl/Sall fragment from pUC19, the resulting strain 
produced abundant quantities of the aromatic polyketide 
SEK4 (12) . The exact structure of this product is 
slightly different from desoxyerythrolaccin (Bartel, P.L. 
et al. J", Bacteriol. (1990) 172:4816). However, in vivo 
isotopic labeling studies using 1,2- 13 C 2 - labeled acetate 
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confirmed that the polyketide backbone is derived from 8 
acetates. Moreover, the aromatic region of the *H 
spectrum, as well as the 13 C NMR spectrum of this 
product, are consistent with a tricyclic structure 
similar 'to 1/ but lacking any ketoreduction (see Table 
4). 

Example 4 

Construction and Analysis of Hybrid Polyketide Synthases 

A. Construction of hybrid PKSs including components from 
act, qra and tcm PKSs 

Figure 1 shows the PKSs responsible for 
synthesizing the carbon chain backbones of actinorhodin 
(3) , granaticin (4) , and tetracenoxnycin (5) (structures 
shown in Figure 5) which contain homologous putative 
KS/AT and ACP subunits, as well as the 0RF2 product. The 
act and grra PKSs also have KRs, lacking in the tcm PKS. 
Corresponding proteins from each cluster show a high 
degree of sequence identity. The percentage identities 
between corresponding PKS proteins in the three clusters 
are as follows: KS/AT: act/gra 76, act/ tcm 64, gra/tcm 
70; CLF: act/gra 60, act /tcm 58, gra/tcm 54; ACP: act/gra 
60, act/ tcm 43, gra/tcm 44. The act and gra PKSs 
synthesize identical 16-carbon backbones derived from 8 
acetate residues with a ketoreduction at C-9 (Figure 6) . 
In contrast, also as shown in Figure 6, the tcm 
polyketide backbone differs in overall carbon chain 
length (20 instead of 16 carbons) , lack of any 
ketoreduction, and regiospecif icity of the first 
cyclization, which occurs between carbons 9 and 14, 
instead of carbons 7 and 12 for act and gra. 

In an attempt to generate novel polyketides, 
differing in a range of properties, as well as to 
elucidate aspects of the programming of aromatic PKSs, a 
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systematic series of minimal PKS gene clusters, using 
various permutations of the 0RF1 (encoding the KS/AT 
subunit) , ORF2 (encoding the CLF subunit) and ORF3 
(encoding the ACP subunit) gene products from the act, 
gra and 'tern gene clusters were cloned into pRM5 in place 
of the existing act genes, as shown in Table l. The 
resulting plasmids were used to transform CH999 as above. 

Analysis of the products of the recombinant 
PKSs containing various permutations among the KS/AT, 
ORF2 product, and ACP subunits of the PKSs (all 
constructs also containing the act KR, cyclase, and 
dehydratase genes) indicated that the synthases could be 
grouped into three categories (Table 1) : those that did 
not produce any polyketide; those that produced compound 
1 (in addition to a small amount of 2) ; and those that 
produced a novel polyketide 9 (designated RM20) (Figure 
6) . The structure of 9 suggests that the polyketide 
backbone precursor of this molecule is derived from 10 
acetate residues with a single ketoreduction at the C-9 
position. 

In order to investigate the influence of the 
act KR on the reduction and cyclization patterns of a 
heterologous polyketide chain, pSEK15 was also 
constructed, which included tern ORFs 1-3, but lacked the 
act KR. (The deletion in the act KR gene in this 
construct was identical to that in pSEK4 . ) Analysis of 
CH999/pSEK15 showed the 2 0 carbon chain product, SEK15 
(13) which resembled, but was not identical to, 
tetracenomycin C or its shunt products. NHR spectroscopy 
was also consistent with a completely unreduced 
decaketide backbone (see Table 4) . 

All act /gra hybrids produced compound 1, 
consistent with the identical structures of the presumed 
actinorhodin and granaticin polyketides. In each case 
where a product could be isolated from a tcm/act hybrid, 
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the chain length of the polyketide was identical to that 
of the natural product corresponding to the source of 
0RF2. This implies that the 0PF2 product, and not the 
ACP or KS/AT, controls carbon chain length. Furthermore, 
since all polyketides produced by the hybrids described 
here, except the ones lacking the KR (CH999/pSEK4 and 
CH999/pSEK15) , underwent a single ketoreduction, it can 
be concluded that: (i) the KR is both necessary and 
sufficient for ketoreduction to occur; (ii) this 
reduction always occurs at the C-9 position in the final 
polyketide backbone (counting from the carboxyl end of 
the chain); and (iii) while unreduced polyketides may 
undergo alternative cyclization patterns, in nascent 
polyketide chains that have undergone ketoreduction, the 
regiochemistry of the first cyclization is dictated by 
the position of the resulting hydroxyl, irrespective of 
how this cyclization occurs in the non-reduced product. 
In other words, the tern PKS could be engineered to 
exhibit- new cyclization specificity by including a 
ketoreductase . 

A striking feature of RM20 (9) is the pattern 
of cyclizations following the first cyclization. 
Isolation of mutactin (6) fron an actvil mutant suggested 
that the actVII product and its tern homolog catalyze the 
cyclization of the second ring in the biosynthesis of 
actinorhodin (3) and tetracenomycin (5), respectively 
(Sherman, D.H. et al. Tetrahedron (1991) 47:6029; 
Summers, R.G. et al. J . Bacteriol . (1992) 174:1810). The 
cyclization pattern of RM20 (9) is different from that of 
1 and tetracenomycin Fl, despite the presence of the 
actVII gene on pRK20 (9) . It therefore appears that the 
act cyclase cannot cyclize longer polyketide chains. 

Unexpectedly, the strain containing the minimal 
torn PKS alone (CH999/pSEK33) produced two polyketides, 
SEK15 (13) and SEKlSb (16), as depicted in Figure 8, in 
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approximately equal quantities. Compounds (13) and (16) 
were also isolated from CH999/pSEK15 , however, greater 
quantities of, compound (13) were isolated this construct 
than of compound (16) . 

SEK15b is a novel compound, the structure of 
which was elucidated through a combination of NMR 
spectroscopy, sodium [1,2- 13 C 2 ] acetate feeding 
experiments and mass spectroscopy. Results from *H and 
13 c NMR indicated that SEKlSb consisted of an unreduced 
anthraquinone moiety and a pyrone moiety. Sodium [1,2- 
13 C 2 ] -acetate feeding experiments confirmed that the 
carbon chain of SEKlSb was derived from 10 acetate units. 
The coupling constants calculated from the 13 C NMR 
spectrum of the enriched SEK15b sample facilitated peak 
assignment. Fast atom bombardment (FAB) mass 
spectroscopy gave a molecular weight of 381 (M + H + ) , 
consistent with C 20 H 12 O 8 . Deuterium exchange was used to 
confirm the presence of each hydroxyl in SEKlSb. 

In order to identify the degrees of freedom 
available in vivo to a nascent polyketide chain for 
cyclizing in the absence of an active cyclase, 
polyketides produced by recombinant S. coelicolor 
CH999/pRM37 (McDaniel et al. (1993), supra) were 
analyzed. The biosynthetic enzymes encoded by pRM37 are 
the tern ketosynthase/acyltransf erase (KS/AT) , the tcin 
chain length determining factor (CLF) , the tern acyl 
carrier protein (ACP) , and the act ketoreductase (KR) . 

Two novel compounds, RM20b (14) and RM20c (15) 
(Figure 8) were discovered in the culture medium of 
CH999/pRM37, which had previously yielded RM20 (9). The 
relative quantities of the three compounds recovered were 
3:7:1 (RM20:RM20b:RM20c) * The structures of (14) and 
(15) were elucidated through a combination of mass 
spectroscopy, NMR spectroscopy and isotope labeling 
experiments. 2 H and 13 C NMR spectra suggested that RM2 0b 
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and RM20c were diastereomers, each containing a pyrone 
moiety. Optical rotations ([a] D 20 were found to by 
+210.8° for RM20b (EtOH, 0.55%) and +78.0° for KM20c 
(EtOH, 0.33%) » Sodium [l, 2-»C 2 ] -acetate feeding 
experiments confirmed that the carbon chain of RM20b (and 
by inference KM20c) was derived from 10 acetate units. 
Deuterium exchange studies were carried out in order to 
identify l H NKR peaks corresponding to potential hydroxyl 
groups on both RM20b and RM20c. Proton coupling 
constants were calculated from the results of J H NMR and 
one-dimensional decoupling experiments. In particular, 
the coupling pattern in the upfield region of the 
spectrum indicated a 5-proton spin system of two 
methylene groups surrounding a central carbinol methine 
proton. High resolution fast atom bombardment (FAB) mass 
spectroscopy gave molecular weights of (519.0056) (M = 
Cs + ) for RM2 0b and 387.1070 (M + H + ) for RM20c, which is 
consistent with C 20 H ls O 8 £K + Cs + , 519,0056; M + H + , 
387.1080). Based on theses data, structures (14) and 
(15) (Figure 8) were assigned to RM20b and RM20c, 
respectively. 

Data from a H and 13 C NMR indicated that the 
coupling constants between H-9 and the geminal protons on 
C-8 were 12.1 or 12.2 and 2.5 or 2.2 Hz for RM20b or 
RM20c, respectively. The coupling constants between H-9 
and the geminal protons on C-1Q were 9.6 or 9.7 and 5.7 
or 5.8 Hz for Rm20b or RM20c, respectively. These values 
are typical of a J 3/a (J 9a#8a or J 9a/10a ) and J a/6 (J 9a/8e 
or J$a t xoe} coupling pattern, and indicate an axial 
position for H-9 in both RM2 0b and Rm20c. In contrast, 
the chemical shifts of the C-7 hydroxyls on the two 
molecules were 16.18 and 6.14 ppm for RM2 0b and RM20c, 
respectively. These values indicate a hydrogen bond 
between the C-7 hydroxyl and a suitably positioned 
acceptor atom in RM20b, but not in RM2 0c. The most 
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likely candidate acceptor atoms for such hydrogen bonding 
are the C-13 carbonyl oxygen in the conjugated pyrone 
ring system, or the bridge oxygen in the isolate pyrone 
ring. The former appears to be likely as it would be 
impossible td discriminate between (14) and (15) if the 
latter were the case. Furthermore, comparison of 13 C NHR 
spectra of RM20b and RH20c revealed that the greatest 
differences between (14) and (15) were in the chemical 
shifts of the carbons that make up the conjugated pyrone 
ring (+5.9, -6.1, +8.9,-7.8 and +2.0 ppm for C-ll, c-12, 
C-13, C-14 and C-15, respectively) . Such a pattern of 
alternating upfield and downfield shifts can be explained 
by the fact that the C-7 hydroxyl is hydrogen-bonded to 
the C-13 carbonyl, since hydrogen bonding would be 
expected to reduce the electron density around C-ll, C-13 
and C-15, but increase the electron density around C-12 
and C-14. To confirm the C-7/C-13 hydrogen bond 
assignment, the exchangeable protons RM2 Ob and RM20c were 
replaced with deuterium (by incubating in the presence of 
D 2 0) , and the samples were analyzed by 13 C NMR. The C-13 
peak in RM20b, but not RM20c, underwent an upfield shift 
(1.7 ppm) , which can be explained by a weaker C-7/C-13 
non-covalent bond in RM20b when hydrogen is replace yith 
deuterium. In order to form a hydrogen bond with the C- 
13 carbonyl, the C-7 hydroxyl of RN20b must occupy the 
equatorial position. Thus, it can be inferred that the 
C-7 and C-9 hydroxyls are on the same face (syn) of the 
conjugated ring system in the major isomer (RM2 0b) , 
whereas they are on opposite sides (anti) in the minor 
isomer (RM20c) . 

No polyketide could be detected in CH999/pRM15, 
/pRM35, and /pRM3 6. Thus, only some ORF1-0RF2 
combinations are functional. Since each subunit was 
functional in at least one recombinant synthase, protein 
expression/folding problems are unlikely to be the cause. 
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Instead, imperfect or inhibitory association between the 
different subunits of these enzyme complexes, or 
biosynthesis of (aborted) short chain products that are 
rapidly degraded, are plausible explanations* 

B. Construction of hybrid PKSs including components from 
act and fren PKSs 

Streptomyces roseofulvus produces both 
frenolicin B (7) (Iwai, Y. et al. J. Antibiot. (1973) 
31:959) and nanaonycin A (8) (Tsuzuki, K. et al. J. 
Antibiot. (1986) 39:1343). A 10 kb DNA fragment 
(referred to as the fren locus hereafter) was cloned from 
a genomic library of S. roseofulvus (Bibb, M.J. et al. 
submitted) using DNA encoding the KS/AT and KR components 
of the act PKS of S. coelicolor A3 (2) as a probe 
(Malpartida, F. et al. Nature (1987) 325:818). (See 
Figure 7 for structural representations.) DNA 
sequencing of the fren locus revealed the existence of 
(among others) genes with a high degree of identity to 
those encoding the act KS/AT, CLF, ACP, KR, and cyclase. 

To produce the novel polyketides, the 0RF1, 2 
and 3 act genes present in pRM5 were replaced with the 
corresponding fren genes, as shown in Table 2. S. 
coelicolor CH999, constructed as described above, was 
transformed with these plasmids. (The genes encoding the 
act KR, and the act cyclase were also present on each of 
these genetic constructs.) Based on results from similar 
experiments with act and tern PKSs, described above, it 
was expected that the act KR would be able to reduce the 
products of all functional recombinant PKSs, whereas the 
ability of the act cyclase to catalyze the second 
cyclization would depend upon the chain length of the 
product of the fren PKS. 

The results summarized in Table 2 indicate that 
most of the transformants expressed functional PKSs, as 
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assayed by their ability to produce aromatic polyketides. 
Structural analysis of the major products revealed that 
the producer strains could be grouped into two 
categories: those that synthesized compound 1 (together 
with a smaller amount of its decarboxylated side-product 
(2), and those that synthesized a mixture of compounds i, 
10 and 11 in a roughly 1:2:2 ratio. (Small amounts of 2 
were also found in all strains producing 1.) Compounds l 
and 2, had been observed before as natural products, and 
were the metabolites produced by a PKS consisting 
entirely of act subunits, as described in Example 3. 
Compounds 10 and ll (designated RM18 and RMlSb, 
respectively) are novel structures whose chemical 
synthesis or isolation as natural products has not been 
reported previously. 

The structures of 10 and 11 were elucidated 
through a combination of mass spectroscopy, nmr 
spectroscopy, and isotope labeling experiments. The 2 H 
and 13 C spectral assignments are shown in Table 3, along 
with 13 c- 13 C coupling constants for 10 obtained through 
sodium [1,2- 13 C 2 ] acetate feeding experiments (described 
below) . Unequivocal assignments for compound 10 were 
established with ID nuclear Overhauser effect (NOE) and 
long range heteronuclear correlation (HETCOR) studies. 
Deuterium exchange confirmed the presence of hydroxyls at 
C-15 of compound 10 and C-13 of compound 11. Field 
desorption mass spectrometry (FD-MS) of 2 revealed a 
molecular weight of 282, consistent with C 17 H I4 0 4 
(282.2952) . 

Earlier studies showed that the polyketide 
backbone of 2 (Bartel, P.L. et al. J. Bacteriol . (1990) 
172 : 4816) (and by inference, 1) is derived from iterative 
condensations of 8 acetate residues with a single 
ketoreduction at C-9. It may also be argued that 
nanaomycin (8) arises from an identical carbon chain 
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backbone. Therefore, it is very likely that nanaomycin 
is a product of the fren PKS genes in S. roseofulvus, 
Regiospecificity of the first cyclization leading to the 
formation of 1 is guided by the position of the 
ketoreductiqn, whereas that of the second cyclization is 
controlled by the act cyclase (Zhang, H.L. et al. J. Org. 
Chem. (1990) 55:1682). 

In order to trace the carbon chain backbone of 
RM18 (10), in vivo feeding experiments using [1,2- 13 C 2J 
acetate were performed on CH999/pRM18, followed by KMR 
analysis of labelled RM18 (10) . The 13 C coupling data 
(summarized in Table 3) indicate that the polyketide 
backbone of RM18 (10) is derived from 9 acetate residues, 
followed by a terminal decarboxylation (the C-2 i3 C 
resonance appears as an enhanced singlet) , which 
presumably occurs non-enzymatically . Furthermore, the 
absence of a hydroxyl group at the c-9 position suggests 
that a ketoreduction occurs at this carbon. Since these 
two features would be expected to occur in the putative 
frenolicin (7) backbone, the results suggest that, in 
addition to synthesizing nanaomycin, the fren PKS genes 
are responsible for the biosynthesis of frenolicin in S. 
roseofulvus . This appears to be the first unambiguous 
case of a PKS with relaxed chain length specificity^ 
However, unlike the putative backbone of frenolicin, the 
C-17 carbonyl of RM1S (10) is not reduced. This could 
either reflect the absence from pRM18 of a specific 
ketoreductase, dehydratase, and an enoylreductase 
(present in the fren gene cluster in S. roseofulvus) , or 
it could reflect a different origin for carbons 15-18 in 
frenolicin. 

Regiospecificity of the first cyclization 
leading to the formation of RM18 (10) is guided by the 
position of the ketoreduction; however the second 
cyclization occurs differently from that in 7 or 1, and 
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is similar to the cyclization pattern observed i.i RM20 
(9) , a decaketide produced by the tcm PKS, as described 
above. Therefore, as in the case of RM20 (9), it could 
be argued that the act cyclase cannot catalyze the second 
cyclization of the RM18 precursor , and that its 
subsequent cyclizations, which presumably occur 
non-enzymatically, are dictated by temporal differences 
in release of different portions of the nascent 
polyketide chain into an aqueous environment , In view of 
the ability of CH999/pRM18 (and CH999/pRM34) to produce 
1, one can rule out the possibility that the cyclase 
cannot associate with the fren PKS (KS/AT, CLF, and ACP) . 
A , more likely explanation is that the act cyclase cannot 
recognize substrates of altered chain lengths. This 
would also be consistent with the putative biosynthetic 
scheme for RM20 (9) . 

A comparison of the product profiles of the 
hybrid synthases reported in Table 2 with analogous 
hybrids between act and tcm PKS components (Table 1) 
support the hypothesis that the ORF2 product is the chain 
length determining factor (CLF) . Preparation of 
compounds 9, 10 and 11 via cyclization of enzyme-bound 
ketides is schematically illustrated in Figure 8. 

Example 5 

Construction and Analysis of Modular Polvketide Synthases 

Expression plasmids containing recombinant 
modular DEBS PKS genes were constructed by transferring 
DNA incrementally from a temperature-sensitive "donor" 
plasmid, i.e., a plasmid capable of replication at a 
first, permissive temperature and incapable of 
replication at a second, non-permissive temperature, to a 
"recipient" shuttle vector via a double recombination 
event, as depicted in Figure 10. pCK7 (Figure 11), a 
shuttle plasmid containing the complete eryA genes, which 
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were originally cloned from pSl (Tuan et al. (1990) Gene 
90:21), was constructed as follows. A 25.6 kb Sphl 
fragment from pSl was inserted into the Sphl site of 
pMAK705 (Hamilton et al. (1989) J\ Bacterial. 171:4617) 
to givepCKS (Cm R ) , a donor plasmid containing eryAII, 
eryAIII, and the 3' end of eryAJ. Replication of this 
temperature-sensitive pSClOl derivative occurs at 30 °C 
but is arrested at 44 °C. The recipient plasmid, pCK5 
(Ap R , Tc R ) , includes a l2.2kb eryA fragment from the 
eryAI start codon (Caffrey et al. (1992) FEBS Lett. 
304:225) to the Xcml site near the beginning of eryAII, a 
1.4 kb EcoRI - Bsml pBR322 fragment encoding the 
tetracycline resistance gene (Tc) , and a 4.0 kb NotX - * 
EcoRI fragment from the end of eryAIII. Pad , Ndel, and 
ribosome binding sites were engineered at the eryAX start 
codon in pCK5. pCK5 is a derivative of pRM5 (KcDaniel et 
al. (1993), supra). The 5' and 3' regions of homology 
(Figure 10, striped and unshaded areas) are 4.1 kb and 
4.0 kb, respectively. MC1061 E. coli was transformed 
(see, Sambrook et al., supra) with pCK5 and pCK6 and 
subjected to carbenicillin and chloramphenicol selection 
at 30°C. Colonies harboring both plasmids (Ap R , Cm R ) 
were then restreaked at 44 °C on carbenicillin and 
chloramphenicol plates. Only cointegrates formed by a 
single recombination event between the two plasmids were 
viable. Surviving colonies were propagated at 30 °C under 
carbenicillin selection, forcing the resolution of the 
cointegrates via a second recombination event. To enrich 
for pCK7 recombinants, colonies were restreaked again on 
carbenicillin plates at 44°C. Approximately 20% of the 
resulting colonies displayed the desired phenotype (Ap R , 
Tc s ,Cm s ). The final pCK7 candidates were thoroughly 
checked via restriction mapping. A control plasmid, 
pCK7f, which contains a frameshift error in eryAI, was 
constructed in a similar manner. pCK7 and pCK7f were 
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transformed into E. coli ET12567 (MacNeil (1988) J. 
Bacteriol. 170:5607) to generate unmethylated plasmid DNA 
and subsequently moved into Streptomyces coelicolor CH999 
using standard protocols (Hopwood et al. (1985) Genetic 
5 manipulation Streptomyces. A laboratory manual. The 
John Xnnes Foundation: Norwich) . 

Upon growth of CH999/pCK7 on R2YE medium, the 
organism produced abundant quantities of two polyketides 
(Figure X) . The addition of propionate (300 mg/L) to the 
10 growth medium resulted in approximately a two-fold 

increase in yield of polyketide product. Proton and 13 c 
NMR spectroscopy, in conjunction with propionic-l- 13 C 
acid feeding experiments, confirmed the major product as 
6dEB (17) (> 40 mg/L) . The minor product was identified 
15 as 8, 8a-deoxyoleandolide (18) (> 10 mg/L), which 
1^1 apparently originates from an acetate starter unit 

M instead of propionate in the 6dEB biosynthetic pathway. 

fe C 2 sodium acetate feeding experiments confirmed the 

Q incorporation of acetate into (IB). Three high molecular 

flj 20 weight proteins (>200 kDa) , presumably DEBS1, DEBS2, and 

DEBS3 (Caffrey et al. (1992) FSBS Lett. 304:225), were 
|J also observed in crude extracts of CH999/pCK7 via 

SDS-polyacrylamide gel electrophoresis. No polyketide 
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1)1 



25 



products were observed from CH999/pCK7f . 



Thus, novel polyketides, as well as methods for 
recombinantly producing the polyketides, are disclosed. 
30 Although preferred embodiments of the subject invention 

have been described in some detail, it is understood that 
obvious variations can be made without departing from the 
spirit and the scope of the invention as defined by the 
appended claims. 
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TABLE 1 



Plasmid 


ORF1 
(KS/AT) 


ORF2 
(CLDF) 


ORF3 
(ACP) 


Major 
Product(s) 


Backbone 
Carbon 


[jruviy 






aLl 


1 9 


1 ft 
1 O 


pHM/ 




act 


act 


1,2 


1 o 


pRM12 


act 


gra 


act 


1,2 


16 


pRM22 


act 


act 


gra 


1,2 


16 


pRMlO 


tcm 


act 


act 


1,2 


16 


pRM15 


act 


tcm 


act 


NP 




pRM20" 


* tcm 


tcm 


act 


9 


20 


pRM25 


act 


act 


tcm 


1,2 


16 


pRM35 


tcm 


act 


tcm 


NP 




pRM36 


act 


tcm 


tcm 


NP 




pRM37 


tcm 


tcm 


tcm 


9,14,15 


20 


pSEK15 


tcm 


tcm 


tcm 


13,16 


20 


pSEK'33 


tcm 


tcm 


act 


13,16 


20 


TABLE 2 


Plasmid 


ORF1 
(KS/AT) 


ORF2 
(CLDF) 


ORF3 
(ACP) 


Major 
Product(s) 


pRM5 


act 




act 


act 


1,2 


pRM8 


fren 




act 


act 


1,2 


pRM13* 


act 




fren 


act 


NP 


pRM23 


act 
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fren 


1,2 


pRM18 
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fren 


act 


1,2,10,11 


pRM32 


fren 




act 
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NP 


pRM33 


act 




fren 


fren 


NP 


pRM34 


fren 




fren 


fren 


0001,2,10, 
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